Design of Data-Driven Mathematical Laws for Optimal Statistical Classification Systems
نویسنده
چکیده
This article will devise data-driven, mathematical laws that generate optimal, statistical classification systems which achieve Bayes’ error rate for data distributions with unchanging statistics. Thereby, I will design learning machines that minimize the Bayes’ risk or probability of misclassification. I will devise a system of fundamental equations of binary classification for a classification system in statistical equilibrium. I will use this system of equations to formulate the problem of learning unknown, linear and quadratic discriminant functions from data as a locus problem, thereby formulating geometric locus methods within a statistical framework. Solving locus problems involves finding the equation of a curve or surface defined by a given property and finding the graph or locus of a given equation. I will devise three systems of data-driven, locus equations that generate optimal, statistical classification systems. Each class of learning machines satisfies fundamental statistical laws for a classification system in statistical equilibrium. Thereby, I will formulate three classes of learning machines that are scalable modules for optimal, statistical pattern recognition systems, all of which are capable of performing a wide variety of statistical pattern recognition tasks, where any given M class statistical pattern recognition system exhibits optimal generalization performance for an M -class feature space.
منابع مشابه
تأثیر ساختواژهها در تجزیه وابستگی زبان فارسی
Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...
متن کاملLaws of Large Numbers for Random Linear
The computational solution of large scale linear programming problems contains various difficulties. One of the difficulties is to ensure numerical stability. There is another difficulty of a different nature, namely the original data, contains errors as well. In this paper, we show that the effect of the random errors in the original data has a diminishing tendency for the optimal value as the...
متن کاملMULTICRITERION SYNTHESIS OF INTELLIGENT CONTROL SYSTEMS OF GENERATING UNIT OF NUCLEAR POWER STATION
The paper is devoted to solution of some problems in nuclear power station generating unit intellectual control systems using genetic algorithms on the basis of control system model development, optimizations methods of their direct quality indices and improved integral quadratic estimates. Some mathematical vector models were obtained for control system multicriterion quality indices with due ...
متن کاملSelf-similar solutions of the Riemann problem for two-dimensional systems of conservation laws
In this paper, a new approach is applied to study the self-similar solutions of 2×2 systems of nonlinear hyperbolic conservation laws. A notion of characteristic directions is introduced and then used to construct local and smooth solutions of the associated Riemann problem
متن کاملData mining for decision making in engineering optimal design
Often in modeling the engineering optimization design problems, the value of objective function(s) is not clearly defined in terms of design variables. Instead it is obtained by some numerical analysis such as FE structural analysis, fluid mechanic analysis, and thermodynamic analysis, etc. Yet, the numerical analyses are considerably time consuming to obtain the final value of objective functi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016